Structual Vulnerability of the Nematode Worm Neural Graph 
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The number of connected components and the size of the largest connected component are studied 
under node and edge removal in the connectivity graph of the C. elegans nervous system. By 
studying the two subgraphs - the directed graph of chemical synapses and the undirected graph 
of electrical junctions - we observe that adding a small number of undirected edges dramatically 
reduces the number of components in the complete graph. Under random node and edge removal, 
the C. elegans graph displays a remarkable structural robustness. We then compare these results 
with the vulnerability of a number of canonical graph models. 

PACS numbers: 02.10.Ox, 87.18. Sn, 82.39.Rt, 89.75.Da 



Many natural and artificial systems can be described 
in terms of graphs, assemblies of nodes pairwise related 
through edges pQ. One subject receiving attention in re- 
cent literature is the study of robustness, or resilience, 
of graphs subjected to the removal of either nodes or 
edges 2 4 . In technical systems, such as power grids or 
the internet, a low vulnerability (high resilience) to mal- 
functioning nodes or the interruption of links is essential 
to ensure a continuous and stable service [6] . Many 
of these studies have focused on targeted or guided at- 
tacks, in which nodes/edges are removed depending on 
their degree [51 [7] , betweenness [5J [S] or range pQ , pro- 
viding valuable insights into the structural prerequisites 
ensuring high resilience. These studies also suggest that 
the assessment of various graph-theoretical measures as a 
function of node/edge removal can be used as a valuable 
tool to probe the structural characteristics of graphs. 

We posit here that such a feature of resilience should be 
desirable for biological neuronal networks. Upon removal 
of nodes or edges, a graph will typically decompose into 
an increasing number of smaller connected components - 
assemblies of nodes no longer mutually interlinked. The 
vulnerability of the graph is proportional to the rate of its 
decomposition under node and edge removal. Addition- 
ally, graph robustness can be measured by the relative 
size of its largest (giant) connected component. Graphs 
with a connected component comprised of the majority 
of nodes can be considered to have low vulnerability, as 
this dominant component is more likely to contain the 
assembly of nodes/edges crucial to the graph's function. 

In this letter, we use the number of connected compo- 
nents N cc and the size (defined as the number of nodes) 
of the largest connected component S gcc pQ, to explore 
the structural vulnerability of the nematode Caenorhab- 
ditis elegans neural network with respect to uniform- 
random node and edge removal. In contrast to most 
previous studies, which applied a wide range of graph- 
theoretical measures to the undirected version of the C. 
elegans graph [9], we study its original directed version, 
with only self-loops (accounting for about 0.1% of edges) 
excluded. In directed graphs, two nodes are strongly con- 



nected if a directed path exists through which both nodes 
can reach each other. A set of strongly connected nodes 
forms a strongly connected component, which we will re- 
fer to as the connected component. In the first part of 
the study, both measures are applied to the individual 
subgraphs - the undirected graph of electric junctions 
and the directed graph of chemical synapses. In the sec- 
ond part, we compare the obtained results with several 
canonical graph models. Throughout the study, 10,000 
random realizations at each data point were drawn to 
ensure statistical validity. The numerical analysis was 
performed using custom software, which, along with all 
analysis protocols used, is available for download 10J . 

The neuronal graph of the hermaphrodite worm is, to 
date, the only naturally occurring neural graph for which 
an almost complete wiring diagram has been made avail- 
able [HIE]- The complete graph (CG) is sparse (con- 
nectance Co = N E /N™ ax = 0.038; 0) and consists of 
Nn = 279 nodes with a total of Ne = 2,990 edges and an 
average node in/out-degree of (en) — 10.72. From these, 
514 edges are electric junctions (EJ), forming an undi- 
rected subgraph (Co = 0.013) with (a,) = 3.68, and 2,194 
are chemical synapses (CS) with (<Xj) = 7.86 forming a di- 
rected subgraph (Co = 0.028). The complete C. elegans 
graph has 6 connected components, with the largest com- 
ponent comprised of 274 nodes. The CS (EJ) subgraph 
has 42 (29) connected components with 237 (248) nodes 
in its largest component. Importantly, although the CS 
subgraph contributes more than 73% of edges to the com- 
plete graph, it has 7 times more connected components 
than the complete graph. This number is dramatically 
reduced by the addition of a comparably small number 
(27%) of undirected edges, suggesting a complementary 
or synergistic role of both subgraphs. 

This remarkable synergistic tendency is also observed 
when nodes and edges are removed from the graphs 
(FigJI^i). Denoting with <7{ Sj &} the fraction of occupied 
nodes/edges, both the CS and EJ subgraphs display a 
large N cc , which increases modestly with node/edge re- 
moval, despite the fact that the number of edges de- 
creases almost linearly with the number of removed 
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FIG. 1. (a) N cc , (b) S gcc (dotted; tic marks left), and relative 
S a cc (solid; tic marks far right) as functions of 1 — q{ s ,b} f° r the 
C. elegans graph, and its chemical synapse (CS) and electric 
junction (EJ) subgraphs (mean ± SD). Dotted lines in (a) 
show polynomial- fitted numerical derivatives (dotted), with 
arrowheads indicating PLD (see text). 



nodes/edges (not shown). The complete graph is charac- 
terized by a significantly smaller N cc . Even after removal 
of 50% of nodes/edges, N cc remains comparably small 
(N cc = 13.8 ± 2.3 for node removal, 28.1 ± 2.4 for edge 
removal), a surprising result given the sparseness of the 
C. elegans graph. As removing nodes will naturally yield 
N cc — for q s — 0, for some intermediate value of q s 
a point of largest decomposition (PLD) may be reached, 
defined as the (global) maximum of N cc . A numerical 
difference analysis (Figfl^, dotted lines) shows that the 
PLD for both subgraphs is almost identical (q s ~ 0.4, 
i.e. for 60% of removed nodes), whereas it shifts signifi- 
cantly to lower q s (80% removed nodes) for the complete 
graph, providing further support for the complimentary 
combination of the individual subgraphs. 

A similar resilience is observed with S gcc under 
node/edge removal. As expected, the largest compo- 
nent in the complete graph is always bigger than that 
of its subgraphs, and S gcc decreases quasilinearly with a 
slope of about -1.11 (-1.07 for CS, -1.18 for EJ; FigJTJj, 
dashed). This linear decrease is somewhat surprising 
and more expected from unstructured graphs (e.g. Erdos- 
Renyi graph) as it suggests a homogeneous and isotropic 
connectivity pattern. This gradual linear decrease trans- 
lates directly into an only modest decrease of the relative 
size (fraction of remaining nodes) of the largest compo- 
nent as a function of q s (Fig(T|D, solid). After removal 
of 50% of the nodes, the largest component contains still 
more than 90.3% of the remaining nodes in the graph 



(98% for q s = 1). Edge removal displays similar behav- 
ior, with 90.1% of the nodes remaining in the largest 
component for q^ = 0.5, further corroborating the low 
structural vulnerability of the C. elegans neural graph. 

In order to further explore structural characteristics of 
the C. elegans graph, we applied the above analysis to a 
number of canonical graph models constructed to yield 
the same Nm and about the same Ne as the complete C. 
elegans graph as well as its CS and EJ subgraphs: 

Erdos-Renyi model (ER) [12]. We employed the clas- 
sical directed and undirected ER model, which chooses 
uniform randomly from the ensemble of all graphs with 
a given number of nodes and edges. 

Watts-Strogatz model (WS) [T3]. In this classical ex- 
ample of small-world graphs, average node degrees were 
set to 11 (yielding Ne — 3,069) for the directed model 
matching CG, 8 (N E = 2,232) for CS, and 2 (N E = 558) 
for EJ. A rewiring probability p w = 0.01 was chosen for 
all models to ensure high clustering coefficient and small 
average geodesic graph distance. To construct directed 
WS graphs, we first constructed appropriate undirected 
WS graphs after which the direction of each edge was 
randomized with a probability of 0.5. 

Barabdsi- Albert model (BA) [3]. For random scale-free 
graphs, we used the BA model of preferential attachment. 
The initial number of nodes were 11, 8 and 2 for the CG, 
CS and EJ, respectively. Initial node degrees and incre- 
ments were fixed to 11 (CG), 8 (CS) and 2 (EJ), yielding 
N E = 3,003 (CG), 2,196 (CS) and 555 (EJ). Directed 
versions were obtained from appropriate undirected BA 
graphs and randomization of edge direction. 
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FIG. 2. Average geodesic graph distance (d), global and av- 
erage node clustering coefficient (C D and {Cf}), and node 
in/out-degree distributions for the C. elegans (CE) and a num- 
ber of canonical graph models (see text). The insets show 
log-log plots of the degree distributions. 
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Approximate degree-matched random model (ADM) . In 
order to discern to which extent the resilient behavior of 
the C. elegans graph depends on its node degrees, we 
constructed a model which approximates uniform sam- 
pling from the collection of all graphs with a given degree 
distribution. The adjacency matrix ay was defined as 
dij = 1 if p < ^a° ut a % p, where a | m ' OMt l denote the node 

in/out-degrees of the target graph and a — £\ a\ m,owt ^ , 
and aij = otherwise. A random number p € [0, 1) was 
chosen for each randomly selected index pair with 
i,j E [0,Nn) until Ne edges were generated. 

Exact degree-matched Erdos-Reayi model (EDM). Fi- 
nally, a sophisticated model introduced by Kim and col- 
leagues [T3] was used which provides a biased sampling 
of the set of graphs with specific node degrees. 

We first compared the average geodesic graph distance 
(d) [I], the global clustering coefficient C D , the aver- 
age node clustering coefficient (C/ 3 ) [15] and node degree 
distributions in the various models. As expected, none 
of the canonical models (ER, WS, BA) reproduced the 
node degree distribution observed in the C. elegans graph 
(Figj^). The latter was, however, well approximated by 
the ADM model. Somewhat surprisingly, the low value 
of (d) was shared among all models, with the exception of 
WS (Figj2ji). Although the WS model allows adjustment 
of this value through a change of its rewiring probabil- 
ity p w , a lower value of (d) can only be achieved on the 
expense of a reduction in its small- worldness index |16j . 
defined here for directed graphs as 
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where C® Dm and (d) EDM are the corresponding values for 
the EDM model. Choosing p w = 0.2 yields (d) = 2.87 
and C D = 0.19, values comparable to that observed in 
the C. elegans graph. However, S D will decrease from 
4.47 ± 0.03 for p w = 0.01 to 3.73 ± 0.05 for p w = 0.2 
(S D = 2.98 for C. elegans, 1.77 for BA, 0.83 for ER). In 
this study, we choose to explore a WS model in which S D 
is approximately maximal. Our findings suggest that a 
characterization of the C. elegans neural graph as small- 
world or scale- free should receive caution [T7], although 
further study is necessary to support this conclusion. 

Further support for this cautious assessment is pro- 
vided by the vulnerability analysis. None of the canon- 
ical graph models was capable of reproducing the syn- 
ergy between the subgraphs which gives rise to the high 
resilience observed in the complete C. elegans graph 
(Figj3j). The N cc for large q{ Si b} wa s significantly lower 
than in C. elegans by construction, and increased rapidly 
after reaching a critical node/edge occupation probabil- 
ity q{s,b} ~ 0.4 (ER, WS) or 0.6 (BA) for CG. Such a 
critical behavior was not observed in C. elegans. Fur- 
thermore, the PLD in these canonical models displayed 
a clear dependency on the average node degree, with the 
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FIG. 3. N cc (solid; tic marks left) and relative S gcc (dashed; 
tic marks far right) for node (left) and edge (right) removal 
in canonical graph models (see text). Models corresponding 
to the complete graph and the CS/EJ subgraphs are shown 
as in Fig. [l] Insets display the absolute size of the the largest 
connected component. Arrowheads indicate PLD. 



EJ subgraph showing the lowest PLD. Interestingly, both 
ADM and EDM captured, for CG, the dependence of N cc 
on <Z{ Sj {,} seen in C. elegans (Figlil). While both subgraphs 
of ADM deviated significantly from C. elegans, only the 
CS of EDM differed, with EJ capturing the behavior of 
the biological graph under node/edge removal. Taken to- 
gether, these observations suggest that while the degree- 
matched graphs provide the best model for the structural 
decomposition of C. elegans, there remains a small but 
significant structural component which is not explained 
by a degree-matched random connectivity alone. 

Differences between C. elegans and canonical graph 
models were also prominent when considering S gcc . In 
neither ER, WS or BA a quasilinear dependency of S gcc 
was observed, but a clear supralinear decrease, most 
significant in the WS model (Fig. [3j insets). For the 
ER model this finding is particularly surprising, as its 
isotropy and homogeneity should ensure a decrease of 
Sgcc proportional to q s . Deviations from this expectation 
must be attributed to the small size of the graph and its 
low connectance. This behavior of S gcc is reflected in 
its relative measure. Here, after reaching a critical oc- 
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FIG. 4. N cc (solid; tic marks left) and relative S gC c (dashed; 
tic marks far right) for node (left) and edge (right) removal 
in the ADM and EDM model (cf. Fig.|3). 



cupation probability, the relative S gcc decreases rapidly, 
and the CS and EJ subgraphs show significant quanti- 
tative differences, a finding not observed in C. elegans. 
Finally, the similarity in the dependency of S gcc on q{ s .b} 
in the ADM and, especially, the EDM when compared to 
the corresponding biological graphs strongly supports the 
conclusion that the structural resilience of the C. elegans 
neural graph is driven by its degree distribution. 

In summary, this letter addressed the structural vul- 
nerability of the nematode worm neural graph by study- 
ing the behavior of the number of connected components 
and the size of the largest connected component as func- 
tion of uniform random removal of its nodes and edges. 
We observed a strong structural resilience in the C. ele- 
gans graph as well as its directed chemical synapse and 
undirected electric junction subgraphs, under both node 
and edge removal. This is surprising given the low con- 
nectance of these graphs. Furthermore, the robustness of 
the residual largest component to perturbations in con- 
nectivity suggests that the C. elegans neural system may 
remain functional even after removal of a significant frac- 
tion of its neurons or synapses, as expected from natu- 
rally systems exposed to detrimental external influences. 

Our study showed further that the chemical synapse 
and electric junction subgraphs of the C. elegans display 
a remarkably similar behavior in both measures, despite 
significant differences in connectance, and that the com- 
plete neural graph is more than a simple combination of 
the two subgraphs. This observation was not shared in 
the investigated canonical graph models. Only models of 
degree-matched random graphs reproduced the various 
structural features of C. elegans studied here. This sug- 



gests that the structural vulnerability of the nematode 
neural graph is critically determined by its node degree 
distribution, and less by specific connectivity properties 
such as scale-free- or small-world-ness. Based on the re- 
sults of this study we cautiously conclude that the clus- 
tering coefficient and geodesic path length alone are not 
sufficient for a characterization of the C. elegans neural 
and other real- world graphs, and that with respect to the 
measures of structural vulnerability considered here, the 
C. elegans graph is best modeled by an (exact) degree- 
matched random graph. 

Our study does not address in detail whether the syn- 
ergistic behavior of the combination of an undirected and 
directed subgraph can be reproduced by a simple increase 
in the number of edges, or whether it is the product of 
a complementary mixing of an undirected and directed 
graph. The results presented here suggest the latter, as 
the addition of only a small number of undirected edges 
significantly reduces the structural vulnerability of the C. 
elegans neural graph. By taking into account the degree 
distributions of individual subgraphs, a specific mecha- 
nism for synergistic combination of directed and undi- 
rected graphs in constructing biological neural graphs 
could be conceived, and will be considered in future work. 



[9 

[10 
[H 
[12 

[13 
[14 

[15 
[16 

[17] 



Electronic address: michelle.rudolph@unic.cnrs-gif.fr 
S. Boccaletti et al, Phys. Rep. 424, 175 (2006); M.E.J. 
Newman, Networks: An Introduction (OUP, 2010). 
R. Albert et al, Nature 406, 378 (2000); K. Goh et al, 
PNAS 99, 12583 (2002); V. Latora and M. Marchiori, 
Phys. Rev. E 72, 015103 (2005). 

R. Albert and A.-L. Barabasi, Rev. Mod. Phys. 74, 47 
(2002). 

A. E. Motter et al, Phys. Rev. E 66, 065103 (2002). 

K. Atkins et al, Int. J. Crit. Infrastruct. 5, 265 (2009); 
S.M. Wagner and N. Neshat, Int. J. Prod. Econ. 126, 
121 (2010). 

R. Albert et al, Phys. Rev. E 69, 025103 (2004). 

P. Crucitti et al, Physica A 320, 622 (2003). 

P. Holme et al, Phys. Rev. E 65, 056109 (2002); M. 

Girvan and M.E.J. Newman, PNAS 99, 8271 (2002). 

L.R. Varshney et al, PLoS Comput. Biol. 7, el001066 

(2011). 

|http://www.cydyns.com| 

B. L. Chen et al, PNAS 103, 4723 (2006). 

P. Erdos and A. Renyi, Publ. Math. Inst. Hung. Acad. 
Sci. 5, 17 (1960). 

D.J. Watts and S.H. Strogatz, Nature 393, 440 (1998). 

C. I. Del Genio et al, PLoS One 5, el0012 (2010); H. Kim 
et al, New Journal of Physics 14, 023012 (2012). 

G. Fagiolo, Phys. Rev. E 76, 026107 (2007). 

M.D. Humphries and K. Gurney, PLoS One 3, e0002051 

(2008). 

G. Lima-Mendez and J. van Helden, Mol. BioSyst. 5, 
1482 (2009). 



